Unoffical empeg BBS

Quick Links: Empeg FAQ | RioCar.Org | Hijack | BigDisk Builder | jEmplode | emphatic
Repairs: Repairs

Page 2 of 2 < 1 2
Topic Options
#257430 - 11/06/2005 11:49 Re: "HDA: Disk Error: Return_Intr" Message question [Re: mlord]
tman
carpal tunnel

Registered: 24/12/2001
Posts: 5528
Quote:
The only way to know for sure about impending drive failures is to watch the S.M.A.R.T. data from smartctl.

I've had several Maxtor drives just fall over and die even when SMART thinks everything is okay. None of the thresholds had been reached and the worst values were still high. In fact I've never had any advanced warning through SMART. The one time it actually passed one of the thresholds, the drive was already dead.

Top
#257431 - 11/06/2005 14:00 Re: "HDA: Disk Error: Return_Intr" Message question [Re: mlord]
tfabris
carpal tunnel

Registered: 20/12/1999
Posts: 31565
Loc: Seattle, WA
Quote:
S.M.A.R.T. monitors a hell of a lot more than simply a count of bad sectors, Tony.

Very true. And I agree that the SMART data is the best way to get an indication of a failing drive, when possible.
_________________________
Tony Fabris

Top
#257432 - 11/06/2005 16:45 Re: "HDA: Disk Error: Return_Intr" Message question [Re: tman]
mlord
carpal tunnel

Registered: 29/08/2000
Posts: 14478
Loc: Canada
Quote:
Quote:
The only way to know for sure about impending drive failures is to watch the S.M.A.R.T. data from smartctl.

I've had several Maxtor drives just fall over and die even when SMART thinks everything is okay.


There is nothing contradictory between those two statements.

Drives can (and do and have) suddenly stop working without any signs of "impending failure", and there's really nothing anyone (other than the manufacturer) can do about it.

Recent example being all of those flawed chips that circulated in laptop (and some desktop) drives from (I think) 2001 - 2003 or so. Just a bad batch of chips that would internally fail at some random point early in their intended lifespan. Really bit IBM quite hard, as well as some other makers.

Most recently here, I have had a nice huge 300GB Maxtor drive develop a few bad sectors. Did I toss it? Of course not, but I did run a full S.M.A.R.T. "long" test (sector scan) on it after repairing the bad sectors. The drive is fine now.

In this case, the real culprit was the USB2 enclosure it was in, which seemed to be developing some kind of issues whereby it was randomly reseting the drive mid-use, which can corrupt sectors when done in the middle of a write operation. So I've tossed the $50 enclosure in favour of a new one, and kept the rather expensive (for its time) drive.

On the other hand, I once had a 80GB notebook drive here that developed bad sectors. After repairing them, it was put back into (less critical) use, and developed more bad sectors. So I belatedly checked out the S.M.A.R.T. stuff and discovered the drive even "knew" it wasn't healthy, and it then got sent off for replacement under warranty. The one they sent back in return is still working fine.

Cheers


Edited by mlord (11/06/2005 16:49)

Top
#257433 - 02/08/2005 03:09 Re: "HDA: Disk Error: Return_Intr" Message question [Re: mlord]
tfabris
carpal tunnel

Registered: 20/12/1999
Posts: 31565
Loc: Seattle, WA
Hey, Mark...

If you had a bad sector in the section that stored the current playlist order on the scratch partition, what would the command be to zero THAT out?

For instance, this command...

dd if=/dev/zero bs=512 seek=4096 count=28672 of=/dev/hda3

... zeroes out the statistics. What would zero out the stored playlists without erasing the EQ data?

I have a bad sector on the scratch partition where the current playlist order is stored. I am planning to get a new drive once I get settled in a new address. But for now I want to force it to map out the bad sectors in that section while I wait.

(I know, I argued specifically against this earlier in this very thread. I swear: If this works for me and the drive doesn't go completely south because of it, I will FAQ this.)
_________________________
Tony Fabris

Top
#257434 - 02/08/2005 08:44 Re: "HDA: Disk Error: Return_Intr" Message question [Re: tfabris]
peter
carpal tunnel

Registered: 13/07/2000
Posts: 4172
Loc: Cambridge, England
Quote:
If you had a bad sector in the section that stored the current playlist order on the scratch partition, what would the command be to zero THAT out?

Down-down-down. You don't specifically care about the zeroes, you just care that it gets written. Selecting a new playlist will write all the sectors you care about. If selecting a new playlist doesn't clear the error, your disk must have run out of replacement sectors in that zone (or there's something else going terribly wrong), and writing zeroes instead won't help.

Peter

Top
#257435 - 02/08/2005 12:44 Re: "HDA: Disk Error: Return_Intr" Message question [Re: peter]
tfabris
carpal tunnel

Registered: 20/12/1999
Posts: 31565
Loc: Seattle, WA
Damn.

Looks like I need to spend the cash on the new hard disk, then.
_________________________
Tony Fabris

Top
#257436 - 08/08/2005 00:44 Re: "HDA: Disk Error: Return_Intr" Message question [Re: tfabris]
mlord
carpal tunnel

Registered: 29/08/2000
Posts: 14478
Loc: Canada
try zeroing the partition first. The player code does single sector I/O usually on the scratch partition, and that isn't good enough since the drive itself only reads/writes tracks.

Cheers

Top
#257437 - 08/08/2005 05:55 Re: "HDA: Disk Error: Return_Intr" Message question [Re: mlord]
tfabris
carpal tunnel

Registered: 20/12/1999
Posts: 31565
Loc: Seattle, WA
Okay, I'll give it a shot while I wait for the new drive to arrive. It means losing my EQ settings, but I can write those down.

If I recall, what I lose by zeroing the whole partition is:

- EQ settings
- Play counts, track overviews, and other per-track statistics
- The current running order/playlist (this is the thing that's not getting saved properly so it's already gone)

Anything else?
_________________________
Tony Fabris

Top
#257438 - 08/08/2005 05:57 Re: "HDA: Disk Error: Return_Intr" Message question [Re: tfabris]
tfabris
carpal tunnel

Registered: 20/12/1999
Posts: 31565
Loc: Seattle, WA
Again, my own FAQ answers me:

Tuner presets and bookmarks. I'm moving to Seattle so tuner presets I was gonna change anyhoo. And I don't have any bookmarks I'm worried about.

Okay, gonna try it.
_________________________
Tony Fabris

Top
#257439 - 08/08/2005 06:33 Re: "HDA: Disk Error: Return_Intr" Message question [Re: tfabris]
tfabris
carpal tunnel

Registered: 20/12/1999
Posts: 31565
Loc: Seattle, WA
Okay, it worked. Hm.

Before I FAQ this as I promised, I'll need to run all those SMART tests on the player to be sure the hard disk is in working order.

Incidentally, what indicated a disk problem to me was Hijack telling me "HDA: LOST INTERRUPT" each time I selected a down-down-down shuffle, and subsequent not-remembering-that-playlist-across-reboots. Selecting a smaller playlist would sometimes work without giving me the error and would be fine across reboots. I determined: Bad sector somewhere in the dynamic data partition. Not cable/header trouble because it had no problem reading the song files and playing them back. I reseated the cable, double checked its crimping, double checked the header, and switched the connector anyway to be sure, but no change.

But since it let me zero-out HDA3 without giving me any errors (either on the screen or in the console) and the player seems (cross fingers) to be working perfectly now, then should I assume that the disk is really fine at the hardware level and there was something ELSE wrong I'm not understanding?

Would something like a power failure or a software glitch that causes a reboot, during a dynamic-data-write, cause this kind of error?
_________________________
Tony Fabris

Top
#257440 - 08/08/2005 06:36 Re: "HDA: Disk Error: Return_Intr" Message question [Re: tfabris]
tfabris
carpal tunnel

Registered: 20/12/1999
Posts: 31565
Loc: Seattle, WA
And by the way, Mark...

THANK YOU for all your incredible expertise on this topic. We are truly fortunate to have the "Linux IDE Guy" around helping us all out. You are, and always have been, The Man.
_________________________
Tony Fabris

Top
#257441 - 08/08/2005 09:32 Re: "HDA: Disk Error: Return_Intr" Message question [Re: mlord]
peter
carpal tunnel

Registered: 13/07/2000
Posts: 4172
Loc: Cambridge, England
Quote:
try zeroing the partition first. The player code does single sector I/O usually on the scratch partition, and that isn't good enough since the drive itself only reads/writes tracks.

Ah, good point. FWIW, to zero the saved playlist and the bookmarks, without touching the tuner or EQ settings, or the per-track dynamic data, you need the command dd if=/dev/zero of=/dev/hda3 bs=512 count=2048; if you've done the new set-max-fid hack, you'll also need dd if=/dev/zero of=/dev/hda3 bs=512 seek=2176 count=1920.

Peter

Top
#257442 - 08/08/2005 09:48 Re: "HDA: Disk Error: Return_Intr" Message question [Re: peter]
mlord
carpal tunnel

Registered: 29/08/2000
Posts: 14478
Loc: Canada
Thanks for figuring out those offsets, Peter. I had just returned from a long (11 hours, 10 caches, 1035KM) drive, and my paltry reasoning skills were not functioning yet (and still probably aren't!).

Tony.. as you noted, you still oughta try a S.M.A.R.T. "long" test on it. It might really be bad still, but what you have done often recovers things nicely.

cheers

Top
#257443 - 08/08/2005 13:00 Re: "HDA: Disk Error: Return_Intr" Message question [Re: mlord]
tfabris
carpal tunnel

Registered: 20/12/1999
Posts: 31565
Loc: Seattle, WA
Thanks, I'm curious if the problem could possibly be all-software. I figured the errors I was getting were only possible as hardware errors.
_________________________
Tony Fabris

Top
#257444 - 08/08/2005 13:14 Re: "HDA: Disk Error: Return_Intr" Message question [Re: tfabris]
tfabris
carpal tunnel

Registered: 20/12/1999
Posts: 31565
Loc: Seattle, WA
Oh, and thanks, Peter and Mark, for those command lines. I updated the faq entry on clearing the dynamic data with those.

I'm still reserving judgement about a new FAQ on using the zero-out command as a workaround for bad sectors. Haven't gotten to the SMART tests yet. Still in a transitionary phase in my relocation and don't have time to run the tests just now.
_________________________
Tony Fabris

Top
#257445 - 08/08/2005 16:07 Re: "HDA: Disk Error: Return_Intr" Message question [Re: tfabris]
mlord
carpal tunnel

Registered: 29/08/2000
Posts: 14478
Loc: Canada
Quote:
Thanks, I'm curious if the problem could possibly be all-software.


Not a chance.

Top
#257446 - 08/08/2005 20:52 Re: "HDA: Disk Error: Return_Intr" Message question [Re: mlord]
tfabris
carpal tunnel

Registered: 20/12/1999
Posts: 31565
Loc: Seattle, WA
Well, then I'm not so embarrassed for buying the new hard disk, then.
_________________________
Tony Fabris

Top
#257447 - 09/08/2005 08:00 Re: "HDA: Disk Error: Return_Intr" Message question [Re: tfabris]
pgrzelak
carpal tunnel

Registered: 15/08/2000
Posts: 4859
Loc: New Jersey, USA
Well, as long as the new hard drive has a larger capacity, you can always say you were upgrading...
_________________________
Paul Grzelak
200GB with 48MB RAM, Illuminated Buttons and Digital Outputs

Top
#257448 - 11/08/2005 04:59 Re: "HDA: Disk Error: Return_Intr" Message question [Re: tfabris]
tfabris
carpal tunnel

Registered: 20/12/1999
Posts: 31565
Loc: Seattle, WA
Quote:
Haven't gotten to the SMART tests yet.

Okay, got around to the SMART tests. I don't know how to decipher the results. They indicate errors, but I'm not sure if the errors are serious. The log file is attached. Does anyone know how to tell if the errors are serious?


Attachments
262418-SerialLogofSMARTtests.txt (353 downloads)

_________________________
Tony Fabris

Top
#257449 - 11/08/2005 11:47 Re: "HDA: Disk Error: Return_Intr" Message question [Re: tfabris]
mlord
carpal tunnel

Registered: 29/08/2000
Posts: 14478
Loc: Canada
The logs retain error info long after errors are repaired. The report you posted shows old information.

It also shows you launching a "long" selftest, but I don't see the results of that test afterwards.. did you wait an hour and then redo the "smartctl -l selftest" command? I see one instance of that right after issuing the "-t long" command, but it doesn't show the results of the "-t long", which means the selftest was still happening (or the unit got powered off or something before it completed).

The selftest commands ("-t long") execute *in the background*, and take a while to complete.

Cheers

Top
#257450 - 11/08/2005 12:48 Re: "HDA: Disk Error: Return_Intr" Message question [Re: mlord]
tfabris
carpal tunnel

Registered: 20/12/1999
Posts: 31565
Loc: Seattle, WA
Damn. I thought I'd waited the 55 minutes it told me to wait.

Guess I'll do it over again.
_________________________
Tony Fabris

Top
Page 2 of 2 < 1 2